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Korean Perspectives on Assessment of Student Achievement 


PARK, Hyun-Jeong* 


In this paper, two nationwide assessments for elementary and secondary 
students’ educational achievement in Korea were reviewed for their assessment 
frameworks and the overall results; the Diagnostic Test for Basic Skills (DTBS) 
of Grade 3 elementary students and the National Assessment of Educational 
Achievement (NAEA) for Grade 6, 9 and 10 students. Also, the results for Ko- 
rea in two large-scale international comparison studies were reviewed; the 
Trends in International Mathematics & Science Study (TIMSS) and the Pro- 
gramme for International Student Assessment (PISA). Finally, the author makes 
some suggestions for future research and policies on the student achievement 
from the Korean perspective. 


1 Introduction 

The Korean government has taken initiatives on the nationwide assessments of student 
achievement since the late 1980s. Currently, there are two nationwide tests targeting elementary 
and secondary school students in Korea; one is the Diagnostic Test for Basic Skills (DTBS) of the 
Grade 3 elementary students and the other is the National Assessment of Educational Achievement 
(NAEA) for the Grade 6, 9 and 10 students. Along with these two nationwide achievement tests, 
Korea has participated two international surveys on student achievement since 1994; one is the 
Trends in International Mathematics & Science Study (TIMSS) and the other is the Programme for 
International Student Assessment (PISA). In this paper, I would like to review the assessment 
framework and the results from these assessments currently in operation, and then make sugges- 
tions for future research and policies on the student achievement from Korean perspectives. 


2 Nationwide Assessments of Student Achievement in Korea 

The Diagnostic Test for Basic Skills (DTBS) of the Grade 3 elementary students started 
in 2002. The legal foundation for this test is contained in Article 9.1 of the elementary and sec- 
ondary education act, which states that the Ministry of Education can carry out tests to evaluate 
the achievement of students in school education. Besides this legal foundation, the DTBS is based 
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on the 7 th national curriculum and the school education normalization policy (Chae et al., 2003). 
The 7 th national curriculum in Korea states that the government can carry out evaluations of stu- 
dent achievement, schools and educational institutions, and planning and implementation of school 
curriculum periodically on the national level, in order to control the quality of national curriculum. 
Also, the school education normalization policy of March 2002 directly introduced the idea of test- 
ing Grade 3 elementary students on the 3R’s (reading, writing, arithmetic) to diagnose their basic 
skills. 

While this background of the introduction of DTBS can tell us much about the objectives 
of this test, the main objectives of the DTBS can be summarized as follows (Chae et al., 2003). 
First of all, the DTBS conducts a scientific evaluation of basic skills and provides necessary sup- 
port to the nation, local educational authorities and schools based on the results of this test in or- 
der to maintain a high level of academic achievement. Secondly, the DTBS confirms whether 
students have reached the minimum proficiency level for reading, writing, and arithmetic at the 
Grade 3 level, and the test produces various policy-based indicators for lower-grade elementary 
school education. Thirdly, the DTBS develops and offers remedial education programs to support 
students under the minimum proficiency level based on the analysis of their characteristics. 

To achieve these objectives, the Korea Institute for Curriculum and Evaluation (KICE), 
which is a management institution responsible for national achievement tests, has carried out the 
DTBS every October since 2002, and has distributed these results to students and schools the fol- 
lowing December. The results for individual students include whether the student has reached the 
minimum proficiency level in each subject area (reading, writing, arithmetic), as well as diagnostic 
information on the sub-areas classified into abilities and content. 

The other nationwide assessment utilized in Korea is the National Assessment of Educa- 
tional Achievement (NAEA). The focus of this paper is on the NAEA because it has been recent- 
ly in the center of educational arguments, while the DTBS has never been in the middle of 
controversy. The NAEA has been conducted with the nationally representative sample of Grade 6, 
9, and 10 students every October since 2000. The NAEA was designed to benchmark the Nation- 
al Assessment of Educational Progress (NAEP) in USA. The objectives of the NAEA are as fol- 
lows (Cho et al., 2007). The main objectives of the NAEA are to measure the educational 
achievement of elementary, middle, and high school students, and to analyze the trends of their 
achievement systematically and scientifically. The research design for the trend analysis of NAEA 
data was introduced in 2003, and the first report of trend analyses was published in 2006. In ad- 
dition to these main objectives, the NAEA provides reference data to improve the national curric- 
ulum by analyzing students’ achievement utilizing specific goals of the curriculum, and 
investigating the problems with curriculum implementation at school and classroom levels. In the 
process of analyzing test items and the relationship between students’ achievement and their back- 
ground variables, the NAEA also provides valuable information to improve teaching and learning 
methods, as well as necessary assistance to set up learning encouragement policies by the govern- 
ment. 

To achieve these objectives, KICE has prepared systematically for the NAEA since 1998. 
KICE launched the initiating plan for the NAEA in 1998 and administered the field tests in social 
studies in 1999. The first nationwide administration of the NAEA was conducted with a 0.5% sam- 
ple of the whole population for Grade 6, 9, and 10 students in October 2000. The sample sizes of 
the NAEA have increased since 2000; the samples were 1% of the 6 lh , 9 th and 10 th graders from 
2001 to 2003, 1% of the 6 th and 9 th graders and 3% of 10 th graders in 2004 and 2005, and 3% of 
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Figure 1 The process of developing the NAEA evaluation tools 
Reprinted from http://www.kice.re.kr/kice/eng/info/info_3.jsp 


the 6 th , 9 lh and 10 th graders in 2006 and 2007. 

For these three grades, students were tested on five subject matters, Korean language, so- 
cial studies, mathematics, science, and English. Since the tests are administered in October, the 
tests for 6 th graders include all the content covered from the 4 th to 6 th grades, the tests for 9 th grad- 
ers include all content covered from the 7 th to 9 th grades, and tests for 10 lh graders include all con- 
tent covered in the 10 th grade. The tests consist of constructed response items as well as 
multiple-choice items. The constructed response items comprise 20-40% of the total score depend- 
ing on the subject matter. Also, listening comprehension tests are included in both Korean and 
English. Along with theses achievement batteries, background questionnaires are administered to 
students, teachers and principals to investigate the relationship between the background variables 
and student academic achievement. 

The process of developing NAEA evaluation tools can be summarized as follows (KICE, 
2008). First, the national curriculum of each subject matter is analyzed to decide the assessment 
areas and to set up achievement standards for each area. The achievement standards are statements 
specifying the objectives and content of the national curriculum. Once the achievement standards 
are developed, assessment standards also need to be developed. The assessment standards are state- 
ments differentiating students’ levels of achievement (four levels of Advanced, Proficient, Basic, 
and Below-Basic) to use as criteria in assessment activities for each subject. Finally, assessment 
tools are developed through a process of reviewing the achievement and assessment standards, set- 
ting a guideline for item development, appointing and training item writers, designing an item writ- 
ing plan, developing items by the item writers, reviewing the items by reviewers, selecting the 
items for field test, conducting a field test, analyzing the results of the field test, revising and se- 
lecting items for a main test, and deciding the final assessment tools for NAEA. 

The results from trend analyses of the NAEA can be summarized as follows (Cho et al., 
2007). First of all, the variations of average standard scores from 2003 to 2006 were largest for 
the 6th graders. Especially for the English and science, the average scores showed a sharp increase 
until 2005 but changed directions and decreased in 2006. For the 9th graders, the average scores 
of English and social sciences increased in 2004, but steadily decreased afterwards, while the av- 
erage scores of other subjects continued to increase until 2005 but showed a decline in 2006. 
Among the 10th graders, the average scores for Korean, mathematics and science fluctuated from 
time to time, while the average scores for English showed a slow increase from 2003 to 2005 be- 
fore decreasing in 2006, and the average scores for social studies continued to slowly decline ever 
since 2003. In general, the average performance of students showed improvement until 2005, but 
decreased in 2006 for all grades. Comparing the years 2003 and 2006 the average achievement of 
6th graders improved in English, science and mathematics, while 9th graders showed improvement 


20 


PARK, Hyun-Jeong 




-♦-Korean — Social studies -^-Mathematics -^-Science —t— English 

Figure 2 Changes in the average standard scores by grades and subject matters from 2003 to 2006 
Note: For the subject of Korean, the referential year of the trend analysis was 2004 and not 2003. The standard devia- 
tions ranged from 7 to 11 depending on the subjects and years. The original figures from Cho et al. (2007) were re- 
organized by author. 

in mathematics and English, and 1 0th graders in Korean and English. However, the average scores 
for social studies decreased from 2003 to 2006 for the 10th graders. 

Secondly, in terms of proficiency levels, the proportion of students at the advanced level 
had generally increased for Grade 6 science, Grade 9 mathematics and Grade 10 Korean, while 
they had decreased for Grade 9 Korean, and Grade 10 Science. However, the proportion of stu- 
dents below basic level generally declined for Grade 6 mathematics, Grade 6 science, Grade 9 
mathematics, Grade 9 science, and Grade 10 Korean. In short, there tended to be a higher propor- 
tion of 6 grade students at the advanced level, there was a higher proportion of students below ba- 
sic levels in the 9th and 10th grades. For the 9th and 10th graders, the proportion of students below 
basic level was much higher in mathematics and science. The proportion of students below basic 
level was 19.8% for 10th graders in 2004. 

Thirdly, in terms of gender differences, girls had higher average scores than boys in Ko- 
rean, social studies, science, and English, while the mean difference between girls and boys fluc- 
tuated in mathematics for 6th graders. For 9th graders, girls had higher average scores in general 
across all the subjects except for mathematics. 10th grade girls had higher average scores in Ko- 
rean, social studies and English, and 10 grade boys had higher average scores in mathematics and 
science. Moreover, there was a higher proportion of girls at the advanced level across all 6 grade 
subjects. For both 9 th and 10 lh graders, there were more girls at the advanced level in Korean and 
English, and more boys at the advanced level in social studies, mathematics and science. Howev- 
er, there was a higher proportion of boys below the basic level across all the subjects for all 
grades. 

In addition to these general descriptions of student achievement, there was found to be a 
relationship between student achievement and background characteristics of the NAEA (Cho et al., 
2007). Private schools tended to have higher average scores in grades 6 and 10, while there was 
no significant difference in Grade 9. Schools with large class sizes had higher proportions of stu- 
dents in advanced levels across all grades, while schools with small class sizes tended to have 
higher proportions of students below basic level. This can be explained by the fact that many 
schools with large class sizes are located in the large cities, while ones with small class sizes are 
mostly in rural areas. Additionally, students whose teachers have high self-efficacy as teachers, high 


Korean Perspectives on Assessment of Student Achievement 


21 




■ Advanced I Proficient I Basic l| Below-Basic 

Figure 3 Changes in the student achievement levels by grades and subject matters from 2003 to 2006 
Note: For the subject of Korean, the reference year of the trend analysis was 2004 instead of 2003. The proportion of 
students below basic level is presented below the x axis. The original figures from Cho et al. (2007) were re-organized 
by the author. 

aspiration to teach and high expectations for student achievement tended to have higher achieve- 
ment scores. Also, students who have parents with higher educational experiences and who spend 
more time talking with parents tended to have higher achievement scores. Finally, significant pos- 
itive correlations with student achievement were found for self-regulated learning, school adapta- 
tion, student-teacher relations, positive self-concept and attitudes towards learning. 


3 International Assessment of Student Assessment 

In addition to these domestic assessments of educational achievement, Korea has also par- 
ticipated in two international comparison studies, the Trends in International Mathematics and Sci- 
ence Study (TIMSS) and the Programme for International Student Assessment (PISA). As a project 
of the International Association for the Evaluation of Educational Achievement (IEA), TIMSS pro- 
vides information to improve teaching and learning in mathematics and science. TIMSS assesses 
achievement in mathematics and science at Grades 4 and 8 and collects a rich array of background 
information to address concerns about school resources and the quality of school curriculum and 
instruction. Conducted every four years on a regular cycle from 1995, TIMSS provides countries 
with an unprecedented opportunity to measure progress in educational achievement in mathemat- 
ics and science. 

Korea has been participating in TIMSS since the first cycle of 1995. TIMSS was designed 
for 4 th and 8 th graders, but only 8 th graders in Korea have continuously participated in TIMSS from 
1995 to 2007, and only 4 th graders participated in 1995. The results of TIMSS can be summarized 
by the following (Martin et al, 2004; Mullis et al., 2004). The results of TIMSS from 1995 to 2003 
showed that Korean students performed well in mathematics and science when compared to other 
participating countries. In 2003, Korea ranked 2 nd with an average score of 589 in mathematics and 
ranked the 3 rd with an average score of 558 in science among 46 participating countries. Further- 
more, Korean students have showed an improvement with significant change over the 8-year pe- 
riod in both mathematics and science (average score of 581 to 587 and 589 score points for 
mathematics and 546 to 549 and 558 score points for sciences). However, the achievement of girls 
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was significantly lower than boys both in mathematics and science compared to other participating 
countries, even though these gender differences have continuously decreased since 1995. Despite 
the relatively strong performance of Korean students, they reported very low self-confidence in 
mathematics and science, as did Japanese students who took this test. 

Korea has also participated in another large-scale international comparison study, the Pro- 
gramme for International Student Assessment (PISA). PISA was designed and developed by the 
OECD in the late 1990s as an ongoing periodic international comparative study in order to collect 
policy-oriented indicators on the educational systems. PISA assesses 15-year-olds in school. Un- 
like TIMSS, it is an age-based survey, rather than a grade-based one. The choice of this population 
means that the assessment is targeted to measure the extent to which students are prepared for the 
daily challenges of adulthood in modern society since compulsory education ends at this age in 
most countries. In this regard, PISA measures competencies, which is termed as “literacy,” rather 
than what is taught directly in schools. The definition of literacy is concerned with the capacity of 
students to extrapolate from what they have learned and to analyze and reason as they pose, solve 
and interpret problems in a variety of situations (OECD, 2007a). PISA surveys have taken place 
every three years since 2000. Although each cycle assesses all three assessment domains (reading, 
mathematics and science), the focus of the survey shifts from domain to domain in rotation, so that 
detailed analyses are periodically available for each domain, and in-depth comparisons are possible 
every nine years. 

Korea has participated in PISA since the 1 st cycle of 2000. The results of PISA have pro- 


Table 1 Summary results of academic performance for Korean 15-year-olds 








Variance in student 

Between-school variance 




Average 

Percentage 


performance(SP) 


in SP explained by 

Domain 

Country 

Year 

scale 

score 

of students 
at level 5/6 

Total 

Between 

schools 

Within 

schools 

SES of 
students 

SES of 
students and 
schools 



2006 

556 

21.7 

80.2 

33.0 

48.7 

2.1 

14.6 


Korea 

2003 

534 

12.2 

75.2 

27.6 

48.9 

4.6 

16.7 

Reading 


2000 

525 

5.7 

54.5 

20.8 

34.6 

2.3 

10.4 

OECD 

2006 

492 

8.6 

100.0 

38.4 

63.4 

5.6 

21.5 


2003 

494 

8.3 

100.0 

31.1 

69.3 

7.6 

21.1 



2000 

500 

9.5 

100.0 

34.3 

67.4 

7.5 

21.6 



2006 

547 

27.1 

102.9 

41.9 

61.9 

6.0 

24.0 


Korea 

2003 

542 

24.8 

99.3 

42.0 

58.1 

7.7 

27.8 

Math 


2000 

547 

- 

84.2 

34.1 

50.7 

4.7 

20.8 

OECD 

2006 

498 

13.3 

100.0 

36.8 

64.6 

7.3 

21.9 


2003 

500 

14.6 

100.0 

33.0 

67.4 

8.3 

22.6 



2000 

500 

- 

100.0 

32.4 

68.6 

8.2 

20.5 



2006 

522 

10.3 

90.2 

31.8 

59.3 

3.8 

16.9 


Korea 

2003 

538 

- 

101.4 

38.9 

63.2 

5.8 

23.2 

Science 


2000 

552 

- 

74.2 

29.4 

45.9 

2.9 

15.7 

OECD 

2006 

500 

9.0 

100.0 

33.0 

68.1 

7.2 

20.5 


2003 

500 

- 

100.0 

29.0 

71.1 

8.0 

20.1 



2000 

500 

- 

100.0 

30.2 

70.5 

7.9 

19.3 


Note: The scale scores are standardized to have a mean of 500 and a standard deviation of 100 across OECD countries. They were 
then vertically scaled to allow for trend analysis. The information by level was reported from the year when the corresponding do- 
main was the main focus of study. The highest level was 5 for reading and 6 for math and science. Total variance in student per- 
formance (SP) was expressed as a percentage of the average variance in SP across OECD countries. The author created this table 
based on data found in OECD (2001), OECD (2004) and OECD (2007b). 
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vided many insights on the strengths and weaknesses of educational systems in Korea and have 
facilitated much research (Yun & Lee, 2006; Lee, 2007; Shin et ah, 2007; Park, 2008). The results 
of PISA have suggested that Korean students have performed relatively well compared to 15-year- 
olds in other participating countries. In PISA 2006, Korea ranked 1 st in reading literacy, 3rd in 
mathematics literacy, and 10 th in science literacy among 57 participating countries. As can be seen 
in Table 1, the performance in reading literacy was improved and mathematics performance re- 
mained stable, while science performance declined in PISA 2006 compared to PISA 2000 and PISA 
2003 (OECD, 2007a). Also, the percentage of students at the highest levels, level 5/6, was much 
higher than the OECD average, except for science, which was near the OECD average. 

Also, to examine the degree of educational inequality, we can look at the amount of total 
variance in student performance, the percentage of between-school variance among total variance, 
and the percentage of variance explained by SES of students and schools. The results of Table 1 
suggest that there exist some amount of educational inequality in Korean education depending on 
the subject domain. Reading seems to be the domain which has least amount of educational in- 
equality while mathematics seems to be the subject with most amount of educational inequality. 
Especially for mathematics, the percentage of between-school variance explained by SES of stu- 
dents and schools has been consistently above the OECD average, unlike other domains. This can 
be explained by hakkun effect in Korea which means that schools in wealthy areas tend to have 
higher academic performance. 

One final, but very important result of PISA was the negative attitudes toward mathematics 
and science held by Korean students. As can be seen in Table 2, 15-year-olds in Korea had quite 
negative attitudes toward mathematics and science, even though they showed relatively high aca- 
demic performances. Considering that these attributions can be regarded as foundations of lifelong 
learning, and that the relationship between these attributions and academic performance is quite 
positive, this phenomenon was taken quite serious and led to a variety of studies and policy chang- 
es to enhance student attitudes towards school learning. 


4 New Challenges for the Assessment of Student Achievement in Korea 

In Korea, the pursuit of higher educational attainment and entrance into prestigious schools 
has always been a priority of parents and students. To enter prestigious universities, Korean stu- 


Table 2 Students’ attitudes toward mathematics and science in relation to their academic performance 





Average performance of 

Percentage of explained variance 

Year 

Index 

Mean 

achievement scale 

in student performance 

index 

Bottom 

Top 

Korea 

OECD 




quarter 

quarter 

average 


interest and enjoyment of math 

-0.12 

500 

593 

15.5 

1.5 

2003 

self-concept in math 

-0.35 

493 

604 

21.4 

10.8 


self-efficacy in math 

-0.42 

469 

617 

33.2 

22.7 


general interest in science 

-0.24 

482 

567 

13.0 

7.2 

2006 

enjoyment of science 

-0.17 

452 

574 

16.7 

10.2 

self-concept in science 

-0.71 

487 

569 

12.6 

8.8 


self-efficacy in science 

-0.21 

477 

563 

14.2 

15.9 


Note: The indexes were standardized to have a mean of 0 and standard deviation of 1 across OECD countries. This table was re- 
organized by author what were listed in OECD (2004) and OECD (2007b). 
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dents spend quite a lot of time for study at schools as well as at private tutoring industries from 
the early stage of school education. Considering these efforts of students and parents, the Korean 
government has made little attempt to monitor the educational progress of students and to diagnose 
and support students who are below the basic proficiency level. International studies such as 
TIMSS and PISA have introduced the need for the systematic assessment of student achievement, 
and are specifically designed for trend analysis, along with a variety of contextual questionnaires. 
Stimulated by the design and results of TIMSS and PISA, the Korean government introduced the 
National Assessment of Educational Achievement (NAEA) and the Diagnostic Test for Basic Skills 
(DTBS) in the early 2000s. 

The results of these tests and those of the international studies have aroused people’s atten- 
tion to some concerns about Korean education, such as the relatively low performance in science, 
relatively large educational inequalities in mathematics, and students’ negative attitudes towards 
school learning (Yun & Lee, 2006; Cho et al., 2007; Shin et al., 2007; Park, 2008). In spite of 
many existing studies, more studies are still needed to explain what makes schools perform well 
in Korean context, how students’ attitudes toward school learning actually function to influence 
academic performances, and how to enhance students’ attitudes toward school learning. Further- 
more, more methodological research is needed to determine the cut-off scores for proficiency lev- 
els, vertical scaling method, and for identification of items with bias. 

Recently, due to a newly introduced law, great changes are expected in the systems for the 
nationwide assessment of student achievement in Korea. In 2008, the Information Announcement 
Act on Educational Institutions was passed. According to this new law, all elementary and second- 
ary schools are required to participate in the national assessment of student achievement such as 
the NAEA and to announce the results in public (Kim et al., 2007). 

To satisfy this requirement, the Korean government changed the NAEA from a sample sur- 
vey to a population survey. From 2008, all 6 th , 9 th and 10 th graders in all elementary and secondary 
schools are required to participate in the NAEA. This is expected to change the nature of the test. 
Until now, the NAEA was not a high-stake test. The students and schools did not respond to this 
test sensitively because the results were only reported to the individual students and schools and 
were never used as a tool for school evaluation. Because of this nature of test, there was little wor- 
ry about item exposure and it was easy to introduce common items for vertical scaling. However, 
it is now expected that schools will take initiatives to systematically prepare for this test in an at- 
tempt to make their schools look better. It will also make the items quite vulnerable for exposure 
and introduce more methodological issues for field testing and vertical scaling. There also has been 
a considerable amount of disagreement over how to develop a computerized scoring system for 
constructed response items, the need for reducing testing areas to fewer than five subjects, and 
changing the respondents’ grades from the 6 th , 9 th and 10 lh to other grades (Kim, 2008; Cheong, 
2008). 

It is quite certain that changing from a sample survey to a population survey and becom- 
ing a high-risk test will introduce some new issues and conflicts in the NAEA. However, we al- 
ways need to be cognizant of the reasons why we are doing these assessments. Essentially, it is 
important to continuously monitor the educational progress of the nation as well as schools, and 
we must find more in-depth answers for the conclusions made in international studies by bridging 
the results of these studies with domestic ones. 
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